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1.   INTRODUCTION 

A  binary  tree  is  a  finite  set  of  nodes,  either  empty  or  containing  one  node 
called  a  root,  such  that  all  other  nodes  are  partitioned  into  disjoint  sets  which 
are  respectively  called  left  and  right  subtrees  of  the  root.   The  subtrees  also 
satisfy  the  definition  of  a  binary  tree.   Thus,  a  binary  tree  is  an  unlabelled 
rooted  arborescence  with  successors  of  at  most  degree  two  distinguished  only  as 
left  and  right. 

Figure  1.1  shows  a  binary  tree  with  six  nodes.   The  root  node  is  shown  at 
the  top  and  is  connected  by  arcs  to  two  immediate  successor  nodes  which  are  the 
roots  of  its  left  and  right  subtrees.   Each  node  with  no  successors  (for  in- 
stance, the  left  subtree  of  the  root)  is  called  a  leaf.   The  level  of  a  node 
indicates  how  deep  it  is  within  the  tree.   Thus  the  root  has  level  one,  its 
immediate  successor  nodes  have  level  two,  and  so  forth  down  the  occupied  por- 
tions of  the  subtrees.   The  height  of  a  binary  tree  is  the  largest  occupied 
level.   A  full  binary  tree  has  no  internal  vacancies  (unoccupied  node  positions) . 
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FIGURE  1.1  -  A  Binary  Tree  with  Six  Nodes 

Binary  trees  are  frequently  used  as  information  storage  structures  on 
digital  computers.   For  instance,  one  of  the  most  popular  methods  of  randomly 
retrieving  information  by  a  key,  or  symbol,  is  to  store  the  key  data  in  a 
binary  tree.   To  search  for  a  particular  symbol,  we  begin  by  looking  at  the 


root  and  proceed  by  applying  the  following  rules  recursively: 

1.  If  the  symbol  matches  the  root  symbol,  the  symbol  is  found. 

2.  If  the  symbol  is  "less  than"  the  root  (according  to  some  binary 
ordering  relation)  continue  the  search  by  considering  the  left 
successor  of  the  root  as  the  new  root  (of  the  left  subtree) . 

3.  If  the  symbol  is  greater  than  the  root,  continue  by  searching  the 
right  subtree. 

4.  If  there  is  no  root,  the  symbol  is  not  in  the  binary  tree. 

We  assume  for  simplicity  that  all  symbols  are  distinct  with  respect  to  the 
ordering  relation.   Otherwise,  the  ordering  relation  and  search  must  be  modi- 
fied in  an  obvious  fashion.   The  construction  of  a  binary  tree  for  use  by  such 
a  search  scheme  may  be  performed  by  sequentially  examining  the  key  symbols  to 
be  inserted.   This  binary  tree  sort  is  a  one  pass  ordering  procedure  which 
proceeds: 

1.  If  there  is  no  root,  insert  the  symbol  as  the  root. 

2.  If  the  symbol  is  less  than  the  root  symbol,  continue  by  considering 
the  left  subtree. 

3.  If  the  symbol  is  greater  than  the  root  symbol,  continue  with  the 
right  subtree. 

As  an  example,  consider  the  six  symbols  ABCDEF  and  a  lexicographical  binary 
ordering  relation.   Suppose  that  the  particular  permutation  of  symbols  ex- 
amined is  BDAFCE.   The  resulting  binary  tree  is  shown  in  Figure  1.2,  and  has 
structure  identical  to  the  tree  in  Figure  1.1.   Note  that  this  same  tree  may 
have  resulted  from  other  permutations  of  the  same  symbols,  for  instance 
BADCFE.   Therefore,  there  is  a  many- to-one  mapping  of  key  symbol  permutations 
to  corresponding  binary  trees. 
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FIGURE  1.2  -  A  Binary  Tree  with  Inserted  Symbols 


The  height  of  the  binary  tree  in  Figure  1.2  is  four  and  thus  the  maximum 
number  of  comparisons  required  to  insert  another  symbol  is  four.   Similarly, 
if  this  tree  is  used  for  retrieving  symbols,  the  maximum  search  length  for  a 
symbol  in  the  tree  is  four,  and  for  a  symbol  not  found  the  maximum  is  five. 

A  computer  implementation  of  a  binary  tree  storage  structure  requires 
that  each  node  be  represented  by  its  key  symbol  accompanied  by  sufficient 
additional  information  to  identify  and  access  the  left  and  right  subtrees. 
This  is  usually  accomplished  by  use  of  a  dense  array  of  node  symbols  each 
with  left  and  right  pointers,  by  node  storage  via  address  calculation  into  an 
array  space  sufficient  to  store  all  possible  binary  trees  with  a  given  number 
of  nodes  and  some  maximum  height,  or  by  some  similar  method. 

In  the  following  sections  we  study  this  widely  used  class  of  binary  trees 
in  order  to  provide  information  useful  in  examining  algorithms  based  on  this 
storage  structure.   A  closed  form  counting  formula  for  the  number  of  binary 
trees  with  n  nodes  and  height  k  is  developed  and  restated  as  a  recursion  more 
useful  computationally.   A  generating  function  for  the  number  of  nodes  given 
height  is  developed  and  used  to  find  the  asymptotic  distribution  of  binary 
trees.   An  asymptotic  probability  distribution  for  height  given  the  number  of 
nodes  is  derived  based  on  equally  likely  binary  trees.   This  is  compared  with 
a  similar  result  for  general  trees. 

Random  binary  trees  (those  resulting  from  the  binary  tree  sorting  algo- 
rithm applied  to  random  strings  of  symbols)  are  counted  in  terms  of  the  mapping 
of  permutations  of  n  symbols  to  binary  trees  of  height  k.   An  explicit  formula 
for  this  number  is  given  with  an  equivalent  recursive  definition  for  computa- 
tional use.   A  generating  function  is  derived  for  the  number  of  symbols  given 
height.   Lower  and  upper  bounds  on  random  binary  tree  height  are  developed  and 
shown  to  approach  one  another  asymptotically  as  a  function  of  n,  providing  a 
limiting  expression  for  the  expected  height. 


The  random  binary  trees  are  examined  further  to  provide  expressions  for 
the  expectations  of  the  number  of  vacancies  at  each  level,  the  distribution  of 
vacancies  over  all  levels,  the  comparisons  required  for  insertion  of  a  new 
random  symbol,  the  fraction  of  nodes  occupied  at  a  particular  level,  the  number 
of  leaves,  the  number  of  single  vacancies  at  each  level,  and  the  number  of  twin 
vacancies  at  each  level.   A  random  process  is  defined  for  the  number  of  symbols 
required  to  grow  a  tree  exceeding  any  given  height. 

Finally,  an  appendix  is  given  with  sample  tabulations  and  figures  of  the 
distributions . 

2.   NUMBER  OF  BINARY  TREES  OF  A  GIVEN  HEIGHT. 

In  this  section  we  consider  the  problem  of  finding  the  number  of  binary 
trees  with  n  nodes  and  height  k.  Denote  this  number  by  t(n,k)  ,  where  n 
and  k  are  positive  integers.   Since   t(n,k)  =  0  unless 

k  <_  n  £  2k  -  1  (2, 

we  are  only  concerned  with  integers   n  =  1,2,...;  k  =  1,2,...   satisfying  the 
inequality  (2.1) . 

An  explicit  formula  for  the  numbers   t(n,k)   can  be  obtained  by  the  fol- 
lowing simple  combinatorial  argument.   Consider  the  class  of  all  binary  trees 
with  n  nodes  and  height  k  which  have  exactly  m.   nodes  at  the  level  j+1, 

j=l,...,k-l.   Let  m,  *  Z.   +  r ,  ,  where  Z.      and  r.   are  numbers  of  nodes 

J    J    J  J        J 

which  are  left  successors  and  right  successors  of  nodes  at  the  level  j .   In 
other  words,  L.      is  the  number  of  nodes  at  level  j+1  at  the  end  of  left 
going  arcs  emanating  from  nodes  at  level  j .   These  can  be  selected  in    ^        \ 

/Vi\  ^  li  I 

ways,  and  the   r.   nodes  in     J      ways.   Thus,  the  total  number  of 
ways  to  arrange  the  arcs  between  nodes  at  levels   j   and  j+1   is  given  by 


£  £U 


J-l\/     J-l\     _      /        3-1,  (2<2) 


r .     /  \     m. 

J    /  \      3 

I . >0        r . >0 

£.+r.=m. 
J      J      3 

Since     m„   =   1      (the   root)    and     hl.    +•••+  m,    =  n-1     with     m.   ^_  1     for     j=l,  . .  .  ,k 
we  obtain  from   (2.2)    the   formula 

— eCOC^-M- 

where  the  summation  is  over  all  integers  m.  ,  j=l,...,k  satisfying 

m  >  1  ,  j=l, .. .,k-l  , 
and 

m..  +•  •  •+  m,    =  n-1  . 

The  formula  is  valid  for  n>l  and  k  satisfying  (2.1),  for  n=l  we  have 
trivially   t(l,l)  =  1. 

Although  (2.3)  is  an  explicit   formula  for  the  number   t(n,k)   it  is  not 
very  convenient  for  calculation.   An  alternate  way  is  through  a  recurrence. 

Let 

t(n,k)  =  T(n,k)  -  T(n,k-1)  ,   n>l,   k  >_  1  (2.4) 

where  T(n,k)   is  the  number  of  binary  trees  with  n  nodes  and  height  not 
exceeding  k.   If  we  define 

1  if  n=0 
T(n,0)   -  {  ,  (2.5) 

0  if  n>0 

and  T(0,k)  =  1  for  k  >  0  ,  we  obtain  the  recurrence  relation 


T(n+l,k+l)  s  /]  TCj,k)T(n-3,k)  , 
J-0 


(2.6) 


valid  for  n>0   and  k >0 .   This  follows  from  the  fact  that  the  class  of  all 
binary  trees  with  n+1  nodes  and  height  not  exceeding  k+1   can  be  partitioned 
into  n+1  subclasses  according  to  the  number  of  nodes   j   in  the  left  subtree 
of  root.   Since  the  heights  of  both  the  left  and  right  subtrees  must  not  exceed 
k  the  number  of  trees  in  the  j-th  class  is  the  product  T(j ,k)T(n-j ,k) ,  and 
(2.6)  follows. 

Note  that  with  the  convention  (2.5)  the  recurrence  (2.6)  yields  automatical 

T(n,k)  =0   for   n  >  2k  -  1  , 

and   that   for     0   <_  n   <_  k    ,      T(n,k)      is   just   the  number  of  binary   trees  with     n 
nodes.      It   is  well  known   (see    [Knuth,   vol.3])    that   the   latter   are   Catalan 
numbers 

Cn=     M\J    '      a>-°  (2-7: 

so   that 

T(n,k)    =   C  for        0   <_  n  <_  k.  (2.8; 

From  the  recurrence  (2.6)  one  easily  obtains  the  sequence  enumerators 

defined  by 

fk(x)  =  y^T(n,k)xn  ,   k^O  .  (2.9; 

n>0 

Since  the  right-hand  side  of  (2.6)  is  the  Cauchy  product,  we  have  immediately 

^  T(n+l,k+l)xn  =  f£(x)  , 
n>0 

from  which  in  view  of  (2.5)  we  obtain 


fk+1(x)  =  1  +  x  fk(x)  ,   k  >_  0  , 


(2.11 


with  fQ(x)  =  1  . 


Note   that      fv(x)      is    a  polynomial   in     x    ,   hence   if      x      is    regarded   as    a   complex 
variable      f,     ,   k=Q,l,...,      is    a  sequence   of   entire   functions. 

We  now  show  that   this   sequence   converges   uniformly   in   a  circular   region 
of    the   complex  plane,    specifically   that   as      k  •*■  <*> 

f,  (z)    ■*■  u>(z)        uniformly   for        |z|    <_    -r      ,  (2.11) 

where 


.   ,  V^  1     /2n\    n        1  -   /  l-4z  ,„    10. 

w(2)   =     lu      51 V    njZ     =  23 *  (2*12) 

n=0 

To   see   this,   note   that  with     C        as   in    (2.7)   we  have     T(n,k)    <   C        for  all     k 
'  n  —     n 

which   together  with    (2.8)    yields 

w(z)    -   fk(z)    =   zk     22  (CirT(n>k)>zn~k    » 

n>k 

so   that   for  I  z     <     -r 

i     i    _     4 

U(z)    -fk(z)|    <    £   Cn    W"     1    SCn4"n      > 

n>k  n>k 

which  is  a  tail  of  the  expansion   ^  (  T  )  =  2  . 

This  result  will  now  be  used  to  develop  an  asymptotic  distribution  of 
the  numbers   t(n,k)   as   k  -*■  °°  .   In  doing  this,  we  follow  the  method  of 
Renyi  and  Szekeres  used  in  [8]  for  a  similar  problem. 

From  (2.4)  we  have  for  k  >_  1,   t(0,k)  =  0  , 

^t(n,k)zn  =  fk(z)  -  fk-1(z)  ,  (2.13) 

n>0 

which  are  entire  functions  for  every  k  >_  1.   Hence  by  the  Cauchy  formula 

i    (  £k(z)  -  £k-i(z) 


where  we  take  the  circle   | z |  =  -r       as  the  contour  of  integration.   To  estimate 
the  integral  we  use  Laplace's  method  by  first  showing  that  as   k  ■*■  °°   the  only 
significant  contribution  of  the  integrand  is  in  the  vicinity  of  positive  real 
axis . 

1  *  A       0 

Let  FO,z)    *  1  +  j  eX<pz      ,      -ir   <    <(>   <_  tt  (2.15 

and   let   for      cj>      fixed 

Fk+1(<J),z)    =   F(<j),Fk((j),z))    ,      k   >   1    , 


F1(<j>,z)        =   F(<j,,z)     . 

Then 

fk(  f  e1^)    =   Fk(*,l)    ,  k  >   1    ,  (2.16 

i.e.    f.   are  iterates  of  the  function  F  .   For   z  =  -r  e     the  function 
k  4 

oj(z)   defined  by  (2.12)  satisfies 

F(<j),u)  =  oa  ,  (2.17 

and  is  given  by 

where  Re/  1-e   >_  0  .   The  curve   ojf-r  a),   — tt  <  4>  <_  tt   is  therefore  the 
curve  of  fixed  points  of  the  function  F($,»).   For  any  particular   <j>   the 
derivative 

—  F(sj),z)  =  F'(4>,z)  =  -j  e   z       so  that 
for  z  =  oj  we  have 


F'(4>,io)|  =  l-i  e^wl  =  ll  -  /l-e1 


2 


(2.19 


The   locus   of  points      /  l-eX^      ,    -tt   <   <£   <_  tt      in   the   complex  plane      is   an 

Lemniscate  of   Bernoulli  with  polar   equation     p      =2   cos    8,   while      1  -   /  l-eX<p 


just  shifts  the  curve  one  unit  to  the  right.   (See  Figure  2.1) 


t=7T 


1-/2 


FIGURE  2.1 


Hence 


F'U.uOl   :  1  , 


(2.20) 


with  equality  if  and  only  if   i=0.   Thus,  if   }>^0   all  fixed  points  (2.18) 
are  atractive  and  since   f,   are  iterates  of  the  function  F   converging  to 


u  this  implies  that  as  k  -*■  °° 


V^e1*)  ^(^e1*)  +  0(|l-  /T7*|k) 


(2.21) 


uniformly  for   N>   >  e    >   0 


The  modulus   |l  -  /  1-e      is  a  decreasing  function  of   |  <p  |   in 
®   -1  I  ?  I  —   m  with  maximum  1   at   $  ■  0.   Further  denoting   u  (q>)  +  iv  (£)  = 
=  1  -  /l-e  '    the  real  and  imaginary  parts  have  asymptotic  expansions 

VW-1-/T-  K/T)' +  °(*5'2)  • 


vi(4>)  =   + 


! 


(2.22) 


f  )"  +  0(,5/2 


as   J  (J)  J  ->  0  .   Hence  for  sufficiently  small 


1-  /l-e1*  I  <  1  -  -i  /7*L 


and  thus  if  we  choose  for  instance  \$\    < 


Zn2k 


we  obtain  from  (2.21) 


as   Lc  ■*■  °° 


^i-1*) -•(*•*♦)  ♦•(•■**) 


(2.23) 


Remark:   The  same  result  can  also  be  obtained  more  directly  by  writing 


fk+l(z)  '  aj(z)  =  1  +   Z  fk(z)  "  (1  "  Z  uj2(z)) 


=  z[fk(z)  -  aJ(z)][fk(z)  +  co(z)]. 

Then  choose  e  >  0  such  that   |w  \-r   e  )        >_   e   for  all  <f>  and  choose  K 

such  that  k  >_  K  =£»  |  f  ,(-r  e1(M  -  u(j-   elcp)  |  <  e  .   Then  with  z  =  -r  e1*  f 

k  >_  K 

f .  ,  -  -  a)  <  t  f.  -  o)  (2  L  +  e) 

1  k+1    '  —  4 '  k    '   '  ' 

wnence  for  all  m  >  1 


^-"1  i  l*s-»K 


1  -  /i-e1*!  +  O 
Hence  (2.2  3)  follows  by  the  same  argument  as  before. 


in 


1     i$ 


If  we  now  substitute    (2.23)    into    (2.14)    we  have  with      z   =  -^  e 


or 


«*.»-£/  frG-1*)  -^(K*) 


-inc 


d(J> 


-IT 


-s/Wi-^-'wa-1*)]-11*  *♦+*"«»(« 


,e"^)   , 


(W 


as  k  ->  °°  . 

To  estimate  the  remaining  integral  we  first  set   <j>  =  0   and  call 


(2.24) 


\=fk  4 


The  recurrence  (2.10)  yields 


hiak 


=  i  + 


an   =  1 


(2.25) 


(2.26) 


10 


from  which  by  substitution  y,  =  2  -  a,   we  obtain  the  recurrence 

Yk-KL  =  Yk  "iYk  '  Y0  =  X  '  (2-2 

1    o 

Thus   y,   is  the  k-th  iterate  of  the  function  £n(x)  =  x  -  7-  xz  ,  and  iterates 

k  4 

of  such  functions  can  be  handled  by  standard  methods  (see  e.g.  [de  Bruijn,  3, 
sec.  8.7  or  Exercise  8.11]).  We  obtain  the  asymptotic  expansion 

Yi,  *  r +  0(k  )  ,    or 


k   k 


k2 


fk(i)  =  2  "  ¥  +  ^""P  +  0(k  2)    aS    k  +  ~-  (2-28 

1  ,    /£n2k\2 
To  obtain  the  expansion  for  \$\    <     1  — — )    we  assume  that  it  has  the  form 


2     9 

where  £,   =  i<£k^  . 

For      cj>      fixed  we    then  have    L     -    =    L    +  7-  £,         so    that 

/l      id>\  Si(C+^0  £n(k+l)  _ 

fk+1  (t  e1*)   -   2  -  -1 +     g    (C  +  ±  5)   +  0(k   2)    , 

k+1V4         y  k  +  1  (k+1)2       2  k 

where        5  "   §,     .      Using   the   expansion 

§1(C  +  k1  ?)    =   Sl("}    +  k  Sl(?)   +    **' 
g2(C  +  £  5)   -  g2(?)   +|  g2'(5)   +   •••        , 

-i--  i         X  1       1_       1_ 

k+1       k     1+k-l  k"k2       k3  "    '••      • 

£n(k+l)    =  £n  k  +     —     -      •••      ,      we   obtain   after   collecting   terms 


(2.29 


11 


fk+lU  e      )=   2  ~  k31U)   +^TS2(° 

+    -     [g^O  -  5g[<5>]  -    —  [2g2(0  -  EgjCO]  +  o(k"2)    . 
k2  k3 

the  other  hand   from   (2.10)  f k+±  (  j  e1*)   =   1  +  -|  e1*   f£  (^  eX<|)) 


On 

tf--rx 
1- 


M  ....  M/    -  -,  -,/  * 


2 


(2.30) 


whence  using    (2.29)    and  e*   =   ev  =   1  +  (  _L  )       +•••,        we  have 

.         /I     i<j»\        .        1        ...        In  k 

fk+lUe    V   =   2  "  kSl(0   +  ~  S2(^ 

k 

+     ~,     t  T  g,2(0    +   52]    "  ^     -  g-,(?)g2(0    +   0(k'2) 
k2        *      X  k  2 

Comparing  terms  of  (2.30)  and  (2.31)  we  obtain 

gx(0  -  SgJU)  =  4  gx(5)  +  £ 

2g9(C)  -  5g2(5)  =  2  81(5)g2(5)   » 

with  initial  conditions  g, (0)  =  g~(0)  =  4   from  (2.28).   These  equations 

have  a  solution 

1  1—2 

g1(C)  ■  2?  cot  y  ?   ,   g2(0  =  ?2(sin  2  £) 

which,  when  substituted  back  into  (2.29),  gives 

P    /l     ±$\      0        2    r        .    1  _.         -en  k       2,    .      1     ,-2  _,.  -2N 

k  \ "4  e     )  ~  k  C  CO     2   ?         — 2~~     ?    (sin  ~2   C)  +    0(k     ) , 

k 


(2.31) 


c2        -^2  IaI    ,  (tz1^' 

where      £     -   i^k      ,       |  $  |    < 

Next  fk(  4   e      J  "   fk-l  {  4   e      J  =  k=I  Vl   COt   2   Ck-1 

.2,     „_  1  -         £n_k   e2    ,  ,      1   .     ,-2 

k  Ck  cot  2  ^k  +  —  ^k   (sxn  2   ^k  } 

In    (k-1)    .2  .      1  .-2  n/1-3v 

T-  sv    ,     (sxn  -=■  ii        )  +     0(k     )       , 

(k-1)2        k  X  2     k-i 
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(2.32) 


1  k-1  E 

and  setting     ^_±  =5-^C,C=?k      ,    so   that     -j— j-     -     £     ,      and  using 

the   expansions 

cot  -j  (?  -  £  S)    =   cot  J  C  +  2k~"  ^sin  I  ?) 

2 

5  1  1        2 

cot  -r  5    (sin  ■=■  O        +    ••  •      , 


we  obtain 


4k2  2    —  2 


-en    (k-1)    =  In  k  -  £  +   •  •  •      ,        and 


1  1       \~^  1       -z"  1  1 

sin  2"  (£  -  k"  5)  1  =      ^sin  ~2   ^      ^  +  k   ?   COt  I  ?  +   *  *  *    ' ' 


-    /  1     i<jA       .         /  1     i<j>  \       1     r2,    .      1   _.-2 


In  k      3        .    1    _    ,    .      1    r.-2    ,      C      /    •      1   cx-2 
— —  ?      cot  "J  £    (sin  J  5)        +     ~ 3    (sin  -j  C) 

k  k 

c3  1  1-2  -3 

-*-     cot  f  g    (sin  f  5)        +  0(k   J)    . 
3  2  2 


5     N  2 


4k- 
However,    for     c,1  =   itf>kz    ,      |<|>f    <  (  """"  L"  )  ,   we  have 

1  2  i  1      i~2 

£    cot  -r-  £   =   0(£n  k)       ,       |  sin  y  ?|  -*-  0        as      k-*=°    , 

so   that  we    can  write 

If  G-in         —     s  N     k-  ' 


k     sin 


The  same  expansion  can  also  be  obtained  by  again  substituting  (2.32)  into 

the  recurrence  (2.10).   If  we  now  insert  (2.33)  into  the  integral  in  (2.24) 

and  use  the  substitution  E,2   =  i<£>kz  ,  (2.24)  becomes 

n    2 


k  ./   sxnz  j  5  X  k 
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where  the  path  of  integration  T      as  determined  from  the  condition 


£n  k 


is  (see  Figure  2.2): 


5  -  t(-1  +  i)   for 


ln2k   <  t  •:  0   , 


/2 


g  -  t(1  +  i)    for    0  <_  t  <  -= —  £n2k  . 


FIGURE  2.2 


The  integrand  is  a  meromorphic  function  with  poles  at 


E        =   2mm 
m 


m  =  ±  1  ,  ±  2  ,..., 


(2.35) 


and  corresponding  residues 


•*[, 


a 


r(5  )  -  4  3(2™)   -  2  %  Umn) 

k 


—  (2ra) 

k 


(2.36) 


Before  applying  the  residue  theorem,  we  have  to  close  the  path  of  integra- 
tion, for  instance  by  the  arc  (see  Figure  2.2) 

A:  -19  o     2i  TT     _     TT 

S  =  pke    '        Pk  =  ^nk>   "i9!^  • 

Deforming  the  arc  in  the  neighborhood  of  the  real  axis  so  that  it  passes 

i       |2       2  2 

approximately  in  between  the  two  poles  we  have  from   | sin  zj   =  sin  x  +  sinh  y 

i     1   i-2 

I  sin  —  E\         <_l+n   ,    n>0 

along  the  arc  for  large  enough  k  .   Hence 
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-  e 
k2 


i     Sin     2 


dC 


tt/2        -^ 


1    (l+n)p£ 


2       . 
p.     sin 

k^      k 


de 


tt/2 


<    (l+rOp, 


2n      2 

T  Pk   9 
k2      * 


de    = 


2      2 
(l+n)ir  p^  k 

2n 


n        2 

npk 


l-e 


2       2 

(l+n)ir  pk  k  n        2  8 

<  —  p        =     o(£n  k)        as        k  ■*■ 

2n  k2     k 


(2.3; 


Finally,  applying  the  residue  theorem  to  the  integral  over  the  contour   T  +  A 
and  using  (2.34),  (2.36)  and  (2.37)  we  have  the  asymptotic  expansion 


t(n,k)  =  4 


n+1 


fit? 

m>l 


4         2 
■(2irm)   -  3(2irm) 


11   /O    N2 

kZ 


+   4n0(e     j  +  4  o(  — )  +  4n0(— - 


(2.38; 


From  here  we  can  get  the  distribution  of  heights  of  binary  trees  with  n 

or  equivalently  the  probability  that  a  randomly  selected  binary  tree  has  height 

k  if  all  binary  trees  with  n  vertices  are  considered  equally  likely. 


Calling  this  quantity  p  (k)   we  have 


P(k)  -  ±±^ 
n        C 
n 


(2.39! 


where  C 


=  y      t(n,k)   is  the  Catalan  number  (2.7).   Using  the  asymptotic 
k 


■  n 


expansion    [6]       C      = 


n  3/2 


n  /tt 


+     0     4  n 


n  -5/2 


we  obtain   for   large  k  and  n, 


15 


pn(k)  -  8  fl       B2  J^   [26(2™)"  -  3(2™)  2  ]e  -(2™>2B 

m>l 

(2.40) 

where  3  =  — sr  .   Note  that  if  k  <_  n  ,  which  is  the  case  of  interest,  the 
k 

dominant  0-term  is  the  last  one.   Thus,  if  for  some  arbitrarily  small  5  >  0 

n3/8  +  5      <  k  <  n      ,  (2.41) 


then  as     n  ->  °° 

I——  "I  2 

pn(k)  ~  s/y     82     ^    [23(2™/  -  3(27Tm)2Je  "(27TTIl)    6    .  (2.42) 

m=l 

To  verify   that    (2.42)    is   indeed  a  probability  distribution  note   that  as 
n  ■*■  °° 

32     ^  [23(2™)"  -   3(2™)']  e  "(27Tm)    S   dk    , 
tc  m=l 

where  the  limits  of  summation  and  integration  are  given  by  (2.41).   Making  a 
substitution 

k  =  n± 

*   /  3 

we  have  as     n  +  °° 


/    J-    Y^     i/<>      ^      /     ofl3/2        -(27rm)"3 
4   /it     /        <  (27nn)        /      23  e  dB 

m=l 


-    (2^)2  y     331/2   e  -(2™)2e   d3 
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But 


1/2  &-(2^)h   d3  =  233/2  e-(2^)23 


3/2  -(27rm)23     , 
+   (2"irm)   /  23    e         dp 


0 
and  we  are  left  with 


lim  4/r?  ^  2(27nn)2  B3/2  e 


2  „3/2    ■(2Trm)2S 


3+0. 

m— ± 


/it   J 
0 


u2  e  y   du  -  1 


2         2 
by  the  substitution         u   =  (2iTm)   3 

If  we  denote  K        as  the  random  variable  with  distribution  p  (k)   and  call  the 
n  n 

asymptotic  distribution  function 

F(x)  =  lim  P(  —  <.  x  J  (2.43) 

we  obtain  by   integrating   the   right-hand  side  of    (2.42)    over     0   <_  k.  <_  2x   vn 

by   the   same  method   as    above 

.  2 
it  m 


F(x)    =   4x  3   tt5/2     /_jm2   e       v    A  (2 

m=l 

It  can  easily  be  seen  from  (2.43)  that  for  x  =  oUri)         F(x)  -*■  0   and 
thus  the  asymptotic  distribution  (2.42)  or  equivalently  (2.43),  (2.44)  is 
valid  for  all  k  in  the  range  (2.1)  as  n  ■*■   ».   (See  Appendix,  Figure  A.l). 
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This  is  identical  with  the  distribution  function  obtained  by  Renyi  and 

Szekeres  (see  [8]  p.  506)  for  the  height  of  general  (as  opposed  to  binary) 

trees.   Only  tne  normalizing  factors  in  (3.43)  differs  by  a  constant,  namely 

•Jl    .   More  precisely,  if  j\,         is  the  height  of  a  general  tree  with  n  nodes 

(i.e.  with  no  restriction  on  the  number  of  successors  of  a  node)  then  for 

large  n 

-tt  ~  /2  X*    ,  (2.45) 

n        n 

a  somewhat  surprising  result. 


3.   BINARY  TREES  GENERATED  BY  RANDOM  PERMUTATIONS. 

As  described  in  the  introduction  the  tree  insertion  algorithm  defines  a 
map  which  assigns  to  every  permutation  ir(l,...,n)   a  binary  tree  with  n 
nodes.   If  all  permutations  of  integers   l,...,n  are  considered  equally 
likely  the  resulting  trees  are  referred  to  as  random.   Thus,  every  numerical 
quantity  defined  on  a  binary  tree  becomes  a  random  variable. 

Let  H   be  the  height  of  a  random  binary  tree  with  n  nodes,  i.e., 

generated  by  random  permutations  of  the  first  n  integers.   Clearly,  the 

probability 

P(H   <  k)  =  K   B(n,k)  ,  (3.1) 

n  —      n. 

where  B(n,k)   is  the  number  of  permutations   ir(l,...,n)   mapped  into  trees 
with  height  not  exceeding  k  .   (See  Appendix,  Table  A. 2). 

In  order  to  obtain  an  explicit  expression  for  (3.1)  we  first  need  a 
suitable  indexing  system  for  the  nodes.   A  natural  way  to  do  this  is  to  con- 
sider first  a  full  binary  tree  and  label  its  nodes  by  the  sequence  of 
positive  integers  starting  from  the  root  and  labelling  in  each  subsequent  level 
from  left  to  right.   (See  Figure  3.1). 
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level  1 
level  2 
level  3 


FIGURE  3.1 


Thus  nodes  at  level  j  ,  j  =  1,2,...,  from  left  to  right  have  labels 


2jl+m,   m=0,...,2jl 


(3.2) 


Note  that  the  left  and  right  successor  of  a  node  labelled  x  have  labels 

2x  and   2x  +  1   respectively,  left  successors  always  have  even  labels,  right 

successors  have  odd  labels  (  the  root  being  an  exception) . 

Next  consider  a  fixed  binary  tree  with  n  nodes  and  for  every  label  x 
of  the  form  (3.2)  define 


d(x)  = 


0  if  there  is  no  node  with  label  x  , 

1  +  d(2x)  +  d(2x  +  1)   if  there  is  a  node  with  label  x  . 


(3.3) 


Note  that   d(x)   is  simply  the  number  of  nodes  in  a  subtree  with  root  at  x  , 
in  particular   d(l)  =  n   and   d(x)  =  1  if  and  only  if  the  node  labelled  x 
has  no  successor,  i.e.,  is  a  leaf. 

Lemma:  There  is  a  one-to-one  correspondence  between  binary  trees 
with  n  nodes  and  height  not  exceeding  k  and  the  set  D(n,k) 
of  vectors 

(d(l),d(2),...,d(2k  -  1)) 

with  non-negative  integral  components  satisfying  the  conditions: 
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(1)  d(l)  =  n  , 

(2)  For  all     j  =  l,...,k 

2j_1  <x<  2j-l  =>   0  <  d(x)  <_   2k"j+1  -  1. 

(3)  d(2x)  +  d(2x+l)  >  0  =>   d(x)  =  1  +  d(2x)  +  d(2x+l)  . 

Proof:   Given   a  binary  tree  the  numbers  d(x)   are  uniquely 
defined  by  (3.3).   Property  (1)  is  obvious,  property  (3)  follows  from 
(3.3)  since  nodes   2x  and  2x  +  1  are  successors  of  node  x  ,  hence 
if  there  is  a  node  with  labels  either  2x  or  2x  +  1  there  must  be  one 

with  label  x  .   Property  (2)  is  necessary  since  if  it  were  violated 
then  there  would  be  a  subtree  with  root  at  level  j   having  at  least 
2*c~j+l  nodes  and  the  height  of  the  tree  would  then  exceed  k  .   Con- 
versely, given  a  vector   (d(l) , . . . ,d(2k-l) )   satisfying  (1)  -  (3), 
construct  first  a  complete  binary  tree  of  height  k  and  then  eliminate 
all  nodes  with  labels  x  such  that  d(x)  =  0  .   Properties  (1)  -  (3) 
then  guarantee  that  the  result  is  a  binary  tree  with  exactly  n  nodes. 

A  vector   (d(l) , . . . ,d(2  -1))  e  D(n,k)   can  be  used  to  compute  the  number 

of  permutations  mapped  into  the  tree  corresponding  to  this  vector.   Consider 

a  permutation  ir(l,...,n)  =  (s,,...,s  )   mapped  into  a  binary  tree  with  height 

not  exceeding  k  and  look  at  the  node  into  which  a  particular  symbol,  say 

s.  ,  is  mapped.   Let  x.   be  the  label  of  this  node  and  let  L(s.)   and  R(s.) 

be  the  ordered  subsets  of   (s,,...,s  ),  which  are  mapped  into  the  left  and 

right  subtrees  of  the  node  x.  .   By  the  very  nature  of  the  mapping  if  the 

elements  of   L(s.)  U  R(s.)   are  reshuffled  the  tree  is  not  changed  as  long  as 

the  order  of  elements  within  each  subset  L(s.)   and  R(s.)   of   L(s.)  U  R(s.)   is 

l  i  i       i 

preserved.   But  the  numbers  of  elements  in  L(s.)   and  R(s.)   are   d(2x.) 
and  d(2x.  +  1)   respectively  so  that  the  number  of  permutations  resulting 
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from  such  a  reshuffle  is 


[d(2Xi)  +  d(2x.  +  1)]'.  [d(x.)  -  1]! 


d(2X;L)!  d(2x±  +  1)1        d(2x±)!  d(2Xi  +  1)1 

using  (3.3)  and   d(x.)  >  0   (since   s.   is  mapped  into  a  node  labelled  x.). 
Repeating  this  argument  for  each  node  of  the  tree  corresponding  to 
(d(l) , . . . ,d(2  -1))   gives  the  total  number  of  permutations 

n    [d(x)  -  1]! 


d(2x)I  d(2x  +  1) 


k-1 
where  the  product  is  over  all  d(x)  >  0  ,   x  =  1,...,2    -  1.   From  here 

using  the  lemma  and  (3.1)  we  obtain  the  formula 

p(H    ;  k)  =  ^    2j      n  Jo(w  "i]'    ,      .  (3.4: 

n'  d£^,k)    d(2x)-  d(2x+D- 

Note   that   if      d'    =    (d'(l)  ,.  ..,d'(2k  -1))      is   obtained   from     d  =    (d(l)  ,  . .  .  ,d(2k- 

by 

(  d(x)      if     d(x)    >   0 
d'(x)      =      {  ,  (3.5; 

(   1  if     d(x)    =   0 

the  formula  (3.4)  takes  on  a  simpler  form 

V-   /  2k-l       ^ 
P(HU)   =    Z-/      n    d*(x)  .  (3.6; 

n        deD(n,k)  V  x=l 

Unfortunately,  except  in  a  few  special  cases,  the  set   D(n,k)   is  quite 
complicated  for  (3.6)  to  be  useful  for  computation.   Again  a  recurrence  re- 
lation may  be  preferable. 

Indeed,  such  a  recurrence  is  quite  easy  to  derive.   Consider  the  left 
and  right  subtree  of  the  root  of  a  random  binary  tree  with  n  nodes.   If 
J   is  a  random  number  of  nodes  in  the  left  subtree  then  we  have 

H  ,  ,  =  1  +  max  {HT  ,  H      T}  .  (3.7; 

n+l  j    n— J 
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Now  J   is  simply  the  number  of  symbols  in  a  random  permutation 

tt(1,  . . .  ,n+l)  =  (s, ,...,s  , -I )  >  which  are  less  than  s,  .   Therefore,   J   is 

uniformly  distributed  over   {0,...,n},  and  H        and  H  are  conditionally 

■j         n~*  j 

independent  given  J  .   Consequently 

P(H    -  1  <  k  |  J  =  j)  =  P(H.  <  k)P(H   .  <  k)  , 

n+i     —  j  —      n-j  — 

from  which  by  calling 

F(n,k)  =  P(Hq  <  k)  (3.8) 

and  taking  the  expectation  we  obtain  the  recurrence 


F(n+l,k+l)  =  ^j-   /  ,F(j,k)F(n-j,k) 

j=0 


valid  for      n  >_   0   ,   k  >_  0     if  we  define        /  (3.9) 

1  if  k  =  0  , 
F(0,k)  = 

"  0  if  k  >  0 

It  may  be  noted   that    (3.9)    gives 


F(n,k)    =   1     for     0   <_  n   <_  k      ,      and 

F(n,k)  =  0   for    n  :•  2K    ,   as  expected 


(3.10) 


Now  (3.9)  can  be  used  to  calculate  the  numbers   F(n,k)   for  moderate  values 

of  k  and  n  , (see  Appendix,  Tables  A. 2  and  A. 3,  and  [2]),  however  the  memory 
requirements  increase  rapidly. 

It  can  also  be  used  together  with  (3.10)  for  some  special  choice  of  n 

and  k  .   For  instance  for  n   close  to   2  -1  we  obtain 

k       2k"j 

F(2k-l,k)  =   fl  (    .1  ^      ,   k  >  1  ,  (3.11a) 


j  =  l\2j-l 
F(2k-2,k)  =  (2k-l)F(2k-l,k)   ,   kil  ,  (3.11b) 

F(2k-3,k)  =  (2k-l)(2k"1-l)F(2k-l,k)  ,   k^2  ,  (3.11c) 
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and  a  few  more.   Note  that  (3.11a)  is  the  probability  of  obtaining  a  full 
tree  of  height  k  .   However,  these  few  terms  are  of  minor  interest. 

Another  possible  approach  is  to  use  generating  functions.   Defining 

CO 

fk(x)   =  2^F(k,n)xn   >   kl°>  (3-12 

n=0 


we  have  immediately  from  (3.9)  the  relation 


fk+1(x)  =  1  +   /  fk2(y)dy   ,   fQ(x)  =  1  .  (3 

Note  that   fi.00   are  again  polynomials,   f,  (0)  =  1  ,  and  that  for  all  x  e    (0,1] 
fk(x)  <  fk+1(x)  ■*■   (l+x)~   as   k-*«  . 

It  is  hoped  that  (3.13)  can  be  used  to  obtain  an  asymptotic  distribution  of 

the  heights   H   .   However,  we  have  not  been  successful  in  that  respect 
°      n 

to  date. 

It  has  been  suggested  to  us  by  A.  Washburn  that  a  lower  bound  on  the 

expected  height  E{H  }   can  be  easily  obtained  from  (3.7).   Taking  expecta- 

n 


tion  we  have 


E{tfn+1>   =   1  +  EdnaxCHj    ,    tf^)  }  (3.14) 


and   conditioning   upon     J 


E{max(HJ,   HQ-J)  |    J-j  }  1  max(E{Hj  }  ,    E{Hn_.»     . 


Since  J   is  uniformly  distributed  over   {0,...,n}   this  implies 


E{max(HJ,  Hn_j)}  >  ^-   2^  max(E{Hj}  ,  E{H   })  .       (3.15) 

3=0 

Hence  if  a   ,  n  =  0,1,...,   is  a  sequence  of  numbers  defined  recursively  by 
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n 


a  j.i  ■  1  +  — rr     7      max(a.  ,a   .)  ,   an  =  0  ,  (3.16) 

n+ 1       n+1  /    j  j   n-j      0 


j=0 

it  is  easily  seen  from  (3.14)  and  (3.15)  that 


E{H  }  >  a     for  all    n  =  0,1,...  (3.17) 

n  —  n 


Note  that   a   is  a  strictly  increasing  sequence  so  that 


2  /     a-    f°r  n  °dd  j 
n  I    n/2<j<n 


2^max(a.,an_j) 
j=0 


a    +2   /    <*•    f°r  n   even. 
n/2      Z—v   J 

n/2<j<n 


Since  clearly   a  =  0(n)   we  have  as   n  -*■  <*> 


a  —  1  + 
n 


n/2<j<n 


which  upon  approximating  the  sum  by  an  integral  yields 

1 
a(t)  ~  1  +  2  /  a(ty)dy  .  (3.18) 

•1/2 

On  the  other  hand,  let  Y  be  a  random  variable  defined  by 

nY     =  J  n~J 

(n-J         if  tfT   <  H      _    . 

x  J  n-J 

Then 

nY      <     max{J,n-J}    ,  (3.19) 
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and  by  (3.7) 


n+1  nY 

Now  for  large  n  ,  Y   is  uniformly  distributed  over   (  -=■  ,  1)  ,  and  since 

y(n)  -  E{Hn) 

is  an  increasing  function  of  n  we  get  from  (3.20)  by  conditioning  on  Y   and 

applying  (3.19)  the  asymptotic  inequality 

1 
u(n)  <  1  +  2  J        y(ny)dy  (3.2 

"T./2 

valid  as   a  ■*  °°  .   But  this  together  with  (3.17)  and  (3.18)  indicates  that 
we  should  have  in  fact 

u(n)  —  a(n)    as   n  -*-  »  (3.2; 

where   a(t)   is  a  solution  of  (3.18).   It  is  easily  verified  that  (3.18)  has 

a  solution 

/   \  £n  t 

a(t)  =  nn  +a  ' 

where     a      is   an   arbitrary   constant.      But   as      u  (n)    ■*■  »      the    constant   can   simply 
be   disregarded   and  we  have   an  asymptotic   equivalence 

u(n)  ~   (1  -  In  2)"1  -en  n  -   3.25889  In  n  (3.2: 

as      n  -*■  °°    .      (See  Appendix,   Tables  A. 2    and   A. 3). 
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4.   FURTHER  RESULTS  FOR  RANDOM  BINARY  TREES. 

We  now  turn  our  attention  to  some  other  quantities  associated  with  random 
binary  trees.   In  particular  we  wish  to  study  the  process  of  growing  the  tree 
as  the  number  of  nodes  increases.   A  convenient  way  to  visualize  such  a  process 
is  to  consider  a  sequence  2L,X„,...   of  independent  random  variables  with  a 
common  continuous  distribution  and  take  as  our  permutation  the  ranks  of 
X-.  , .  . .  ,X   for  each  n  =  1,2,...   .   Clearly,  for  each  fixed  n  all  resulting 
permutations  are  equally  likely,  and  each  new  symbol,  i.e.,  rank  of  X  -  ,  is 
uniformly  distributed  over   {l,...,n+l}. 

If  for  some  n  we  have  a  random  binary  tree  then  a  new  symbol  will  be 
mapped  into  a  new  node.   This  new  node  can  appear  in  one  of  n+1  possible 
locations  in  the  current  tree.   Following  Knuth  [6]  we  will  refer  to  these 
locations  as  vacancies  of  the  current  tree.   They  are  depicted  in  Figure  4.1 
as  empty  squares . 


FIGURE  4.1 

Let  for  n  >  1  ,  j  >  2  ,  V   .   be  the  number  of  vacancies  at  level  1   at 
-      -      n,j 

time  n  ,  i.e.,  in  a  random  binary  tree  with  n  nodes.   Clearly  V   .  =  0 


for  j  >  n+1   and 


V   _  +  V   .  +•••+  V    ,   -  n  +  1   .  (4.1) 

n,2    n,3        n,n+l 
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Also,  obviously 


V    =2 
Vl,2   c 


Suppose  now  that  at  time  n   the  vacancies  have  values   v    ,  ...,v    -  . 
rr  n,2      n,n+i 

Since  a  new  node  is  equally  likely  to  fill  any  of  the  current   n+1  vacancies, 
the  probability  that  it  will  fill  a  vacancy  at  level  j   equals 


v 


n  +  1 


But  then 


while 


V  ,  -  ,   =  v   .  -  1 
n+1, 3      n,j 


Vn+l,j+l  =  Vn,j+1  +  2  ' 


=     v 


n+1 , i  n , i 


for   all        i£j    ,    i^j+1 


Thus  if  we  define  a  sequence  of  random  polynomials 


j>2 


V   .  x^  X      ,   n>l  , 


we  see  that  these  polynomials  form  a  Markov  chain  with  transition  probabilities 


n  >  1,   j  =  2,...,n  +  1  ,   and  the  initial  state 


W..  (x)  =  2x  .    In  particular  we  then  have 

n+2 

E^W  ,T(x)  |  W  (x)l  =  W  (x)  +  Y^  xj_1(2x-l)  ~^-r 
I  n+1     '   n    I      n       '    ■»  n  +  1 


J-2 


-  Wn(x) 


1  + 


2x-l 
n  +  1 


,   n  >  1 


Consequently 


■Kw}-(i+¥)(i^)  -  (i  +  ^f=i) 


n+1  r- 


n. 


r  2x(2x+l)  •••  (2x+n-l) 


"   n!   2-/ 


J-2 


n 


j-l 


(2x) 


j-l 


(4.2) 


27 


where       are  Stirling  numbers  of  the  first  kind  in  Knuth's  definition  and 
notation  [6].   Thus 


EJV  .)    -     £ 

(  n,j  )  n! 


j-l 


j  =  2,. .. ,n  +  1,   n  :■  1  , 


(4.3) 


gives  us  the  expectation  of  the  number  of  vacancies  at  various  levels.   (See  Appendix, 
Table  A. 5).   Other  quantities  of  some  interest  may  be  obtained  from  here. 
For  instance  the  expected  distribution  of  the  number  of  vacancies  over  levels 
is  immediate 


j-l 


'n^=nTIE{Vj}=  TnW: 


n 
U-l. 


(4.4) 


j  =  2, . . . ,n  +  1  ;  n  >_  1  . 

Next  let  K   be  the  number  of  comparisons  needed  to  insert  a  new   (n+l)st 
symbol  into  a  binary  tree  by  the  algorithm  described  in  the  introduction. 
Then  the  probability 


[  \V 

P(K  =  k)    =  E<P(K  =  <|Wn(x))V=  E  j-H* 
n         [        n    ■  n    )  j    n+ 


K+l 


n+1 


k=1, . . . ,n  ,   n  >  1  , 


=  vn(<  +  1)  , 


since  the  number  of  comparisons  is  k      if  and  only  if  the  symbol  fills  a 


vacancy  at  level  <+l  .   Recalling  that  the  Stirling  number 


is  also  the 


number  of  permutations  of  n  symbols  with  exactly   <   cycles  we  can  write 

2K 
P(K   =  <)   =  -~-  q  (<)  , 
n         n+1  ^n 

where  q  (<)   is  the  probability  that  a  random  permutation  of  n  symbols  has 

<   cycles.   (One  wonders  whether  the  number  of  comparisons  can  be  related  to 

the  number  of  cycles.)   (See  Appendix,  Table  A. 7). 
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With 


P(K  =  k)  =  ,    ...   , 
n        (n+1) ! 


<  =  1, . . . ,n  , 


the  generating  function  (by  definition  of  the  Stirling  number)  is 

n 

n   2x  +  k-1 


K  =  l 


Hence  k   is  a  convolution,   k 

n  '    n 

pendent  Bernoulli  variates: 


X-.  +  •  •  •  +  x   ,  where  the  x  's   are  inde- 
1  n  < 


1    P.  " 


Consequently 


u  -  E{k  } 
n      n 


<     K+l 

-  £zl 

qK  "  K+l 

n 

P, 


-E 


i—  +  —  + 

1  2   3 


+ 


n+1 


K-1 


=   2(Hn+l  "  D  - 


with  H   representing  the  Harmonic  number  [6]  (See  also  the  Riemann  Zeta 


function  in  Abramowitz  and  Stegun  [1].) 


a     =  Var 
n 


(V  ■  Z 


n 


p   q        =2 

K    K 


K  =  l 


4f  K+1     *-?  (<+d2 

K=l  K=l 


"     2<Hn+l  "   X> 


4<»n+l  "   « 


(2) 

n+1  n+1 


Since  asymptotically  [5] 


Hn  =  Y+£nn+In-+o(\), 


(2)    k2  1  /  l 

H     =7 TT  +  Of  ■—■ 

n     6    n+1     V  2 

n 


with  y   Euler's  Constant,   0.5772..., 
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we  obtain  for   the  number  of   comparisons 


y    ~2{y-l  +  £n(n+l)}   =  2  £n(n+l)    -  0.8456    ••• 
n 

2  9  2 

a  ~2{y+l  -■=?-+  £n(n+l)}  =  2  £n(n+l)  -  3.4253 
n  o 


Since   x   <  1  and  a  -*■  •  ,  the  central  limit  theorem  applies, 

1  < '  —  n  rr     > 


yielding 


x    y2 


P  I  — x  )  -  ^(x)  =  -=-   /  e   -  dy  . 


an         /  ^27 

— oo 


2tt  V 


Also,  by  the  Berry-Esseen  Theorem  (cf.  Feller  [4],  p.  544) 


sup 
x 


k  -  y  \ 


<    6 -a 


3 
a 
n 


where 


n  n 

,  3 
r 


:   =   7   Elx  -p    =7   p  q   (  p   +  q   ]  . 
n    /  j      '  <    k'     JLmJ    k  k   \    k  kJ 

k=1  <=1 

Asymp  t  o  t i  cally , 

r  =  -2  /H(4)  -  l\  +  4  (V3)  -   : 


(C  -  -) 


n      \  n+1    /     \  n+1    /     V  n+1    /    n+l 

h  2 

—■         +  4(0.20205  ...)  -  j-       +  Y  +  £n(n+l)  +  1 


=  £a(n+l)  -  4.714 
so  that  the  bound 


r 
6^=.  -  0 


V  Ai(n+1) 
n       N 

which  gives  a  fairly  high  rate  of  convergence  to  normality  for  the  distribu- 
tion of  the  number  of  comparisons  to  insert  a  new  symbol.  The  results  given 
can  be  used  to  analyze  a  complete  binary  insertion  sort  algorithm  operating 
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on  a  random  string  of  symbols.   Higher  order  expansions  are  also  possible  for 

describing  this  process. 

Further,  let  M     be  the  number  of  nodes  at  level  j   in  a  random  binary 
n,j 

tree  with   n  nodes.   Since  clearly 

M   . ,  +  V   .,,   =  2M   .    ,   j  =  l,...,n  , 


with 


M    =  1  ,   M      =  0   ,   n  >  1  ,   whence 
n,l         n,n+l  — 


n 

l  V^  / 1  \  r_J 

M   .  =  —   /  ^7      V       ,   or  upon  taking  the  expectation 

n,j    2   /  j  \  2  )             n,r+l         v                  &       v 


r=i 


E{M 


J-l 


H,y  aJ   Lu      r 


r-3 


j-l,  .  .  •  ,n  ,   n  :■  1   . 


(4.5 


(See  Appendix,  Table  A. 4). 
Note  that 

,1-1   n,j 


n!  2^1    [r 

r=j 


is  an  expected  fraction  of  nodes  occupied  at  level   j  .   It  may  be  called  an 
expected  relative  thickness  of  the  tree  at  level  j  .   Looking  again  at  Figure 
4.1  we  see  that  we  can  distinguish  between  two  kinds  of  vacancies,  those  which 
are  attached  to  a  leaf  (and  hence  come  in  pairs)  and  those  attached  to  an  in- 
ternal node  of  the  tree.   We  will  refer  to  these  two  kinds  as  twin  and  single 
vacancies  respectively. 

Let  for   n  >  1,  j  >  2,   V.   be  the  number  of  single  vacancies  and   V   . 
-      ~  '   n,j  6  n,j 

the  number  of  twin  vacancies  at  level   j   in  a  random  binary  tree  with  n  nodes 
Clearly 

V«  +  V<2>   -  V   .   s   Vf>  =  0  ,   V<2>  =  2   . 

n,j     n,j      n,j      1,2         1,2 
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If   at   time     n     the  values   of   these  variables   are 

JV  CD  anA  (2)  (2) 

Vn,2    "•"    Vn,n+1  md  \,2    '""   Vn,n+1      • 

then  filling   a  single  vacancy   at   level     j      results    in 
n+l,j  n,j  n+l,j+l       n,j 

with  probability   — 7^-r   ,   while  filling  a  twin  vacancy  at  level  j   results 

^    V(1)     -  v(1)  +  1     V(2)     =  v(2)  -  2    V(2)     =  v(2)    +  2 
a+l,j      ri,j  n+l,j      n,j         n+l,j+l    n,j+l 

with  probability  — — -   .   The  remaining  vacancy  numbers  are  not  changed  . 

If  we  again  introduce  the  random  polynomials 

B<«  .  y,w  xj-i  ,  w(2)  .  y,»)  ^-1   , 

n      /  >  n,j  n      Z— rf  n,j 

j>2     .  j>2 

we  obtain  for  their  expected  values  the  equations 

E|wnii«|  -  st  E|w"1)(x)!  +  ^rE|w"2)(x)!' 

EjwU)    (x){     .     ^     E|w(1)(x)j     +     2^JW«)(X)J    , 
i     n+1  I  n-rl  j     n  )  n+1  |     n  j 

n   >_  1    ,      with     E  J  W^      (x)  >     =   0    ,      E  j  w£2'  (x)  [   =   2x    . 

Since  W  \  W^   '  (x)  >  +  E\  \i        (x) }      =      E  I  W    (x)  [        we   obtain  by   substitution 

J    n  (  j     n  \  I     n       ) 

Eiw£l  w|-  STEk1)(x)!+  3i"K«|  • 


from  which  by  using  (4.2)  we  have 
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n-1 


E  W(1)(x) 
I    n 


n(n-l) 


}   1N   /    r  E  I  W  (x) 
n(n-l)  Z— i  r 


n-1 


r=l 


r+1 


Lj       (r-D!  2-J    I  j-l 
r-1  j-2  L 


(2x) 


j-l 


(4.6) 


n(n+l) 


J-2 


n-1 


E  ^  £    c^yi 


r-J-1 


r 
j-l 


,   n  >  1 


Thus 


E  <  V 


J-l 


(d|. 

n,j  j    n(n+l) 


n-1 


2^      (r-i): 


r-j-1 


J"1  J 


(4.7) 


j  =  2, . . . ,n  ;  n  >  1  ,   while  the  corresponding  expression  for  twin  vacancies  is 
obtained  by  subtracting  (4.7)  from  (4.3)  with  (4.7)  set  equal  to  zero  for 
j  =  n+1  .   (See  Appendix,  Table  A. 6) . 

We  conclude  this  section  by  computing  the  expected  number  of  leaves  in  a 
random  binary  tree  with  n  nodes.   From  the  first  equality  in  (4.6)  the  ex- 
pected number  of  all  single  vacancies  equals 

n-1  n-1 

E  |  W(1)  (1)  }  =  ?•   .,  ^  r  E  W  (1)   =  -7— TT  T^  r(r+l)  , 

/  n      \        n(n-l)   /  j  I     r        n(n-l)   /  > 


r=l 


r=l 


since  by  (4.2)    E  {W  (1)  }=  n+1  .   The  latter  sum  equals  -r   (n+1)   so  that  the 
expected  number  of  all  twin  vacancies  is   n+1  -  -r   (n+1)  .   The  expected  number 
of  leaves  is  clearly  half  the  number  of  twin  vacancies,  that  is   -r   (n+1)  . 
Thus,  in  a  random  binary  tree  on  the  average  about  —  of  the  nodes  are  leaves. 
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Remark:    Having  in  mind  the  process  of  growing  random  binary  trees  as 
described  above  we  can  also  look  at  a  random  process 

N.  -  mini  n  :  H  >  k  (  ,  Nn  -  1  ,  (4.8) 

k      1     n     I     0 

i.e.   the  time  (=  number  of  symbols)  needed  to  grow  a  tree  over  the  height 
k.   (See  Appendix,  Table  A. 8).   From  (4.8)  clearly 

P(Nk  >  n)  =  P(Hn  <_  k)  =  F(n,k) 

so  that  g,  (x)  =  1  -  (l-x)f,  (x)   is  the  ordinary  probability  generating  function 


■k«  -  E 


P(Nk  =  n)xn 


n>l 


of  the  random  variable  N,  .   Denoting 


u(m,k)  =  E  J  \(Nk-l)  •••  (Nk  -  m  +  1) 


the  m-th   factorial  moment  of  N,  and  using  the  fact  that 

(m)  (m-1) 

u(m,k)  =  *—   gk(l)  =  m  —5—  £  (1) 

dx  dx 


we  obtain  by  applying  Leibnitz  formula  to  (3.13)  the  relation 

m 
u(m+l,k+l)  =  ^-  /^l(^)  i(j+l,k)u(m-j+l,k)  , 

j=0 

m  >_  0  ,  k  >_  0  .    In  particular  with  m=0   this  becomes 

Ej\+l(Nk+l-   "j    "   2(Et\>)2    >      ">0   • 


or  2 

Var{Hk+1}    =   2(E{Nk>)      +  I  {  N^  ) 


l-"t"H.lJ 


Unfortunately,    it   is    the   first  moment      E  { N,    }  ,   which   is  hard   to   obtain   for 
large     k    .      (See  Appendix,    Table  A.  7). 
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